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A METHOD AND SYSTEM FOR EFFICIENTLY EVALUATING AND 



5 DRAWING NURBS SURFACES FOR 3D GRAPHICS 

FIELD OF THE INVENTION 

The field of the present invention pertains to computer implenaented 
graphics. More particularly, the present invention relates to a system and 
10 method for rendering parametric curved surfaces in a three dimensional graphics 
environment. 

BACKGROUND OF THE INVENTION 

Computer graphics are being used today to perform a wide variety of tasks. 
15 Many different areas of business, industry, government, education, 

entertainment, and most recently, the home, are tapping into the enormous and 
rapidly growing Hst of applications developed for today's increasingly powerful 
computer devices. Graphical user interfaces have replaced textual interfaces as 
the standard means for user computer interaction. 

20 

Graphics have also become a key technology for communicating ideas, 
data, and trends in most areas of commerce, science, and education. Until 
recently, real time user interaction with three dimensional (3D) models and 
pseudo-realistic images was feasible on only very high performance workstations. 
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These workstations contain dedicated, special purpose graphics hardware. The 
progress of semiconductor fabrication technology has made it possible to do real 
time 3D animation, with color shaded images of complex objects, described by- 
thousands of polygons, on powerful dedicated rendering subsystems. The most 
5 recent and most powerful workstations are capable of rendering completely life- 
like, realistically lighted, 3D objects and structures. 

In a typical 3D computer generated object, the surfaces of the 3D object 
are described by data models. These data models store "primitives" (usually 

10 mathematically described polygons and polyhedra) that define the shape of the 
object, the object attributes, and the connectivity and positioning data describing 
how the objects fit together. The component polygons and polyhedra connect at 
common edges defined in terms of common vertices and enclosed volumes. The 
polygons are textured, Z-buffered, and shaded onto an array of pixels, creating a 

15 realistic 3D image. 

Many applications require the generation of smooth stufaces and smooth 
curves. To realistically generate a real-world image, the surfaces of objects in the 
image need to be reahstically modeled. The most common representation for 3D 
20 surfaces are "polygon meshes." A polygon mesh is a set of connected, polygonal 
bounded, planar surfaces. Open boxes, cabinets, and building exteriors can be 
easily and naturally represented by polygon meshes. Polygon meshes, however, 
are less easily used to represent objects with curved surfaces. 



25 Referring now to prior art Figure lA, a simple polygon mesh of a section 10 

of a patch is shown, while prior art Figure IB shows an actual section 11 of a 
patch. Each polygon of the section 10 is a mathematical representation of a 
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corresponding portion of the surface of the section 11. The interconnecting 
vertices (e.g., vertices 13) and the interconnecting edges of the polygons (e.g., 
edges 12) collectively define the surface of section 10 in 3D space. There are 
obvious short comings, however, in the accuracy of the section 10 in comparison 
5 to actual section 11. The polygons model the surface of section 11, but the 
representation is only approximate. The "errors" of the approximation can be 
made arbitrarily small by using more and more polygons to create an increasingly 
accin-ate piecewise linear approximation. 

10 Referring now to prior art Figure 2A, prior art Figure 2B, and prior art 

Figure 2C, an initial representation 21, an intennediate representation 22, and a 
final representation 23, are shown respectively. The initial representation 21 is a 
polygon mesh of a rain drop. As described above, initial representation 21 consists 
of a fewer number of polygons, leading to a "blocky," or geometrically aliased, 

1 5 representation of the rain drop. It should be noted that the polygons of initial 

representation 21 are comprised of triangles, whereas the polygons of section 10 
are comprised of quadrilaterals. Regardless of the natiu-e of the polygon primitive 
(triangle, quadrilateral, and the like) used to model a curved surface, the general 
properties, e.g., geometric aUasing, are substantially the same. 

20, 

In proceeding firom initial representation 21 to intermediate representation 
, 22, and to final representation 23, the nimiber of polygons in the polygon mesh 
modehng the rain drop are greatly increased. The final representation 23 contains 
several orders of magnitude more polygons than initial representation 21. This 
25 jrields a much more accurate piecewise linear approximation of the rain drop. 
Computer graphics engineers rely upon a techniques for modeling and 
manipulating the curved stxrfaces of the objects (e.g., the rain drop of final 
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representation 23). For engineering and computer aided design applications, it is 
often highly advantageous to have as accurate a model as possible. 

For example, engineers modeling the surface of the exterior body of an 
5 automobile, or modehng a new mechanical part for a jet engine, desire models 
which are as accurate as possible. The models should enable accurate 
visuahzation of their designs. The models should also be easily manipiilated and 
transformed, allowing engineers to alter a design, visualize the result of the 
alteration, and subsequently alter the design again. 

10 

While modehng objects with curved surfaces can be accomplished using 
polygon meshes having very large numbers of polygons, such models are 
computationally very vmwieldy. If an object is stored as a large polygon mesh, the 
demands upon the data transfer bandwidth of the workstations, the demands 

15 upon the processor power, demands upon memory for storing such models, and 
other such problems, combine to make fast easy intuitive 3D interaction and 
iterative design alteration very slow (if not impossible). As a result, the 3D 
workstation industry utilizes mathematical parametric representations of the 
curved surfaces of an object to define and model the object. Parametric models 

20 also allow an artist or designer to easily manipulate the global shapes of surfaces 
through a small number of interactions. Polyhedral models are computationally 
veiy intensive, requiring many vertices to be repositioned for even a minor change 
in the overall shape of the modeled object. 

25 In a parametric representation, the curves of an object (e,g,, the rain drop 

of Figure 2) are defined mathematically using equations. The equations 
collectively form what is referred to as a parametric representation of the object, 
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specifically, the ctirved surfaces of the object. Parametric representations 
overcome the problems posed by a polyhedral representation (e.g., a polygon 
mesh). The actual shape of the object can be very closely approximated using a 
parametric representation. This representation consimies several orders of 
5 magnitude less space than an equivalent polyhedral representation 

Thus, for example, the rain drop of Figure 2C can be defined and stored as a 
parametric representation. This representation can be easily manipulated, easily 
altered, and easily transformed, among other advantages, in comparison to the 
10 polyhedral representation (e.g., final representation 23). 

Perhaps the most widely used form of curved surface parametric 
representation are non-uniform rational B-spline representations (NURBS). 
NURBS are widely used in the computer aided design (CAD) industry to model 

15 curved siu-faces. Among their many benefits, NURBS have two important 
advantages relative to other forms of parametric representation. The first 
advantage is that they are invariant under projective transformations. The 
second advantage is that they can very accurately define virtually any curved 
surface. Thus, NURBS are widely used in the field of 3D CAD. While parametric 

20 representations of curved surfaces and NURBS are discussed herein, the 
mathematical form and characteristics of curved surface parametric 
representation and NURBS are well known in the art. Those desiring a more 
extensive description of curved surface parametric representation and NURBS 
should consult "Gerald Farin, CURVES AND SURFACES FOR COMPUTER 

25 AIDED DESIGN, ISBN 0-12-249051-7" which is incorporated herein as 
backgroxmd material. 



SGI15-4-453.00 



5 



April 21, 1997 



The problem with NUEBS, however, is the fact that the NURBS model is 
not "natively" rendered by the dedicated rendering hardware of prior art 3D 
graphics workstations. The rendering hardware of typical 3D graphics 
workstations are designed and optimized to function with polygons. The rendering 
5 algorithms, the special purpose rendering hardware, and the display hardware are 
designed to manipulate and display images generated from polygonal graphics 
primitives. Thus, the NURBS models need to be transformed using software into 
polygon meshes prior to rendering. This software executes on the host 
processor(s) or graphics co-processor(s), operates on the NURBS model, and 

10 creates a resulting polygon mesh stored in the memory of the 3D graphics 
workstation. The resulting polygon mesh is then rendered by the dedicated 
rendering hardware into a 3D image on a display. When an engineer makes a 
change to the NURBS model, the parameters of the NURBS model are changed, a 
new resulting polygon mesh is created, and the changed image is subsequently 

15 displayed. 

This prior art rendering process, however, is slow. The specific rendering 
software which transforms the NURBS model into the restilting polygon mesh is 
computationally very expensive. Large numbers of clock cycles are constimed by 
20 the specific rendering software. This greatly detracts from the desired 

"interactive" quality of modem CAD. Additionally, other programs executing 
concTirrently on the workstation are adversely impacted. 

The prior art rendering process makes very large demands on the data 
25 transfer bandwidth of the 3D graphics workstation. The specific rendering 

software executes on the host processor(s) or graphics co-processor(s) and needs 
to move large amounts of data across the busses of the 3D graphics workstation 
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(e.g., transfers to and from main memory or from graphics co-processors to 
polygon rasterizeKs)). This also adversely impacts other programs and processes 
nmning on the 3D graphics workstation. 

In addition, the prior art rendering process does not utiHze the dedicated 
rendering hardware of the 3D graphics workstation. Modem 3D graphics 
workstations often include very sophisticated and very extensive rendering 
subsystems. These subsystems are purposely designed to accelerate the 3D 
graphics rendering process. However, the specific rendering software, as described 
above, executes on the host processor(s) or graphics co-processor(s) and does not 
make use of the dedicated rendering hardware until the resulting polygon mesh is 
completely computed. 

Thus, what is needed is a method which radically speeds up the 3D graphics 
rendering process. The method should greatly speed the process of rendering 
NURBS models. What is further required is a method which accurately renders 
NURBS models and does not unduly burden the data transfer bandwidth of the 3D 
graphics workstation, or constmie an inordinate amoimt of host processor or 
graphics co-processor clock cycles. The method of the present invention provides 
a novel solution to the above requirements. 
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STTMMARY OF THE INVENTION 

The present invention provides a method and a system which greatly 
increases the speed of the graphics rendering process. The method and system of 
the present invention increases the speed and efficiency of the process of 
5 rendering NURBS models. The method and system of the present invention 
accurately renders NURBS models without burdening the data transfer 
bandwidth of the computer system. In addition, the method and system of the 
present invention does not consume an inordinate amount of host processor or 
graphics co-processor clock cycles. In so doing, the present invention provides a 
10 fast and efficient process for rendering NURBS models. 

In one embodiment, the present invention comprises a computer 
implemented process and system for rendering curved lines and surfaces as 3D 
graphics on a display. The system of the present invention includes a computer 

15 system having a processor, a bus, and a 3D graphics rendering pipeline. The 
curves and surfaces are modeled by non-uniform rational B-splines (NURBS). 
The process of the present invention functions by receiving a NURBS model for 
rendering from a software program running on the host processor. The NURBS 
model defines a curve or siirface. The process of the present invention efficiently 

20- converts the NURBS model to a Bezier model using the hardware of the graphics 
rendering pipeline. The Bezder model describes the same curve or stirface. The 
■ process of the present invention subsequently generates a plurality of points on 
the curve or surface using the Bezier model and the graphics rendering pipeline. 
The points on the siirface are then used by the graphics rendering pipeline to 

25 render the curve or surface defined by the Bezier model. In so doing, the present 
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invention provides a fast and efficient process for rendering NURBS models, 
hence, greatly increasing the speed of the graphics rendering process. 

In another embodiment, the present invention renders a curve or sxirface 
directly from a NURBS model defining the curve or surface. In this embodiment, 
the present invention uses the graphics rendering pipeUne to directly evaluate the 
NURBS control points of the NURBS model into points on a curve or surface. 
The points on the curve or surface are then used by the graphics rendering 
pipeline to render the curve or surface, in the manner described above. 

Additionally, the present invention includes a method for automatically 
generating normals (e.g., vectors normal to the surface) for a surface defined by a 
NURBS model. After points on the surface have been generated, in the manner 
described above, the present invention uses the graphics rendering pipeline to 
generate normals for the siuface. The normals are subsequently used to render 
realistic lighting and shading effects onto the surface. 
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iRBTTT.F DESCRIPTION OF THE D RAWINGS 

The present invention is illustrated by way of example and not by way of 
limitation, in the figures of the accompanying drawings and in which Uke reference 
numerals refer to similar elements and in which: 

5 

Prior Art Figure LA shows a simple polygon mesh of a section of a patch of 
the prior art. 

Prior Art Figure IB shows an actual section of the patch of the prior art. 

10 

Prior Art Figure 2A shows an initial representation polygon mesh of the 
prior art. 

Prior Art Figure 2B shows an intermediate representation polygon mesh of 
15 the prior art. 

Prior Art Figure 2C shows a final representation polygon mesh of the prior 

art. 

20 . Figure 3 shows a graphics rendering system in accordance with one 

embodiment of the present invention. 

Figure 4 shows a block diagram of a geometry subsystem and a raster 
subsystem in accordance with one embodiment of the present invention. 

25 
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Figure 5 shows a block diagram of a tri-linear interpolator in accordance 
with one embodiment of the present invention. 

Figure 6 shows a geometric spatial representation of fotar control points of a 
Bezier curve. 

Figure 7A shows the four control points from Figure 6 plugged into the 
inputs of the tri-linear interpolator from Figure 5 in accordance with one 
embodiment of the present invention. 

Figure 7B shows a geometric spatial representation of the four Bezier 
control points, their respective intermediate control points, and the resulting point 
on the curve in accordance with one embodiment of the present invention. 

Figure 7C shows a curve defined by the Bezier control points from Figure 

7B. 

Figure 8A shows the tri-linear interpolator from Figure 5 with Bezier control 
point inputs in blossom notation in accordance with one embodiment of the 
present invention. 

Figure 8B, shows a geometric spatial representation of the four Bezier 
control points and intermediate control points, in blossom notation, in accordance 
with one embodiment of the present invention. 

Figure 9A shows the tri-linear interpolator from Figure 5 having NURBS 
control point inputs in accordance with one embodiment of the present invention. 
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Figure 9B shows a geometric spatial representation of a NURBS to Bezier 
conversion in accordance with one embodiment of the present invention. 

5 Figure 10 shows a table iUustrating the relationship between the LERP 

weights of a tri-Unear interpolator and the resulting Bezier control points in 
accordance with one embodiment of the present invention. 

Figure llA shows a pluraHty of look up tables used in an alternate 
10 embodiment of the present invention. 

Figure IIB shows a tri-linear filter which implements the direct NURBS 
evaluation. 

15 Figure 12A shows a diagram of the process of automatic normal generation 

in accordance with the present invention. 

Figure 12B shows a diagram of an optimized process of automatic normal 
generation. 

20 

Figure 13A shows a tri-linear interpolator having inputs implementing 
surface partial evaluation in accordance with the present invention. 

Figure 13B shows a spatial representation of the surface partial evaluation 
25 process. 
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Figure 14 shows a flowchart the steps of a method in accordance with one 
embodiment of the present invention. 

Figure 15 shows a computer system upon which the present invention may 
be practiced. 



SGI15-4-453.00 



13 



April 21, 1997 



nFTATT.ED DRSnRTPTIQN OF THE IN VENTION 

In the foRowing detailed description of the present invention, a method and 
system for efficiently evaluating and drawing NURBS surfaces for 3D graphics, 
numerous specific details are set forth in order to provide a thorough understanding of 
the present invention. However, it wiU be obvious to one skilled in the art that the 
present invention may be practiced without these specific details. In other instances 
well known methods, procedures, components, and drcuits have not been described in 
detail as not to unnecessarily obscure aspects of the present invention. 



10 NOTATION AND NOMENCTATURE 

Some portions of the detailed descriptions which follow are presented in terms of 
procedures, logic blocks, processing, and other symbcHc representations of operations 
on data bits within a computer memory. These descriptions and representations are 
the means used by those skilled in the data processing arts to most effectively convey 

15 the substance of their work to others skiUed in the art. A procedure, logic block, 

process, step, etc., is here, and generally, conceived to be a self-consistent sequence of 
steps or instructions leading to a desired result. The steps are those requiring 
physical manipulations of physical quantities. Usually, though not necessarily, these 
quantities take the form of electrical or magnetic signals capable of being stored, 

20 transferred, combined, compared, and otherwise manipxilated in a computer system, 
it has proven convenient at times, principally for reasons of common usage, to refer to 
these signals as bits, values, elements, symbols, characters, terms, numbers, or the 
like. 

25 It should be borne in mind, however, that all of these and similar terms are to be 

associated with the appropriate physical quantities and are merely convenient labels 
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applied to these quantities. Unless specifically stated otherwise as apparent firom the 
following discussions, it is appreciated that throughout the present invention, 
discussions utihzing terms such as "rendering" or "processing" or "computing" or 
"converting" or "generating" or "evaluating" or "drawing" or the like, refer to the action 
5 and processes of a computer system (e.g., computer system 1500 of Figure 15), or 
similar electronic computing device, that manipulates and transforms data 
represented as physical (electronic) quantities within the computer system's registers 
and memories into other data similarly represented as physical quantities within the 
computer system memories or registers, or other such information storage, 
10 transmission, or display devices. 

In one embodiment, the present invention comprises a computer 
implemented process and system for rendering curved surfaces as 3D graphics on 
a display. The system of the present invention includes a computer system 

15 having a processor, a bus, and a 3D graphics rendering pipeline. The surfaces are 
modeled by non-uniform rational B-spHnes (NURBS). NURBS are well known and 
are widely used as mathematical parametric models of 3D objects. The process of 
the present invention functions by receiving a NURBS model for rendering from a 
software program running on the host processor. The NURBS model defines a 

20 surface. The process of the present invention efficiently converts the NURBS 
model to a Bezier model using the hardware of the graphics rendering pipeline. 
Bezier models are similarly well known, and are also used as parametric models of 
3D objects. The Bezier model describes the same surface. The process of the 
present invention subsequently generates a plurality of points on the curve or 

25 surface using the Bezier model and the graphics rendering pipeline. The points on 
the curve or surface are then \ised by the graphics rendering pipeline to render the 
curve or surface defined by the Bezier model. 
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In this maimer, the present invention provides a method and a system 
which greatly increases the speed of the graphics rendering process. The method 
and system of the present invention increases the speed and efficiency of the 

5 process of rendering NURBS models. The method and system of the present 

invention accurately renders NURBS models without burdening the data transfer 
bandwidth of the computer system. In addition, the method and system of the 
present invention does not consume an inordinate amount of host processor or 
graphics co-processor clock cycles. In so doing, the present invention provides a 

10 fast and efBcient process for rendering NURBS models. The present invention 
and its benefits are described in detail below. 



Figure 3 shows a graphics rendering system in accordance with the present 
invention. Graphics rendering system 300 includes a graphics rendering pipeHne 

15 301 coupled to a central processing unit (CPU) subsystem 302. The graphics 
rendering pipeline is comprised of dedicated specialized graphics rendering 
hardware and is designed to greatly accelerate the process of rendering 3D 
graphics. The graphics rendering pipeUne 301 includes a geometry subsystem 
303, a raster subsystem 304, and a display subsystem 305, each coupled in 

20 pipeline fashion to CPU system 302. 

As is well known in the art, the process of creating realistic images involves 
a number of stages arranged in the pipeUne fashion as shown in graphics pipeline 
301. The process of creating images firom models is referred to as rendering, and is 
25 described below. Those desiring a more detailed discussion of the rendering process 
and a graphics rendering pipeHne should consult U.S. Patent Number 5,230,039 of 
Mark S. Grossman, Kurt B. Akeley, and Robert A. Drebin, which is incorporated 
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herein as background material. Each subsystem (e.g., geometry subsystem 303) 
implements a stage of the rendering process. The subsystems of graphics pipehne 
301 include dedicated special purpose graphics rendering logic implemented in 
hardware to greatly speed the rendering process. 

5 

With reference still to Figure 3, the CPU subsystem 302 runs the 
application program which creates the models and responds to and interacts with 
the user. The models are stored as database structures maintained in main 
memory. The CPU subsystem 302, running the appHcation program, traverses 
10 the model database and sends high level graphics data to the graphics rendering 
pipeline 301 for rendering. 

The geometry subsystem 303 transforms and clips the graphics data from 
the models into screen coordinates. The geometry subsystem includes a plurality 
15 of floating point processors organized to perform geometric transformation 

calculations in parallel. Each floating point processor is referred to as a geometry 
engine. The geometry subsystem 303 further includes an input FIFO buffer to 
receive the graphics data from the CPU subsystem 302. 

20 The raster subsystem 304 breaks points, lines and polygon data (e.g., 

graphics primitives) into pixels. This process is referred to as rasterization. The 
raster subsystem 304 performs the calculations required to reduces the graphics 
primitives received from the geometry subsystem to individual pixels. Each pixel 
is assigned an x, y screen position, plus a nxmaber of parameters per pixel, 

25 including z, red blue, alpha, and texture parameters s and t. These parameters 
are linearly interpolated between the vertices and the edges of the primitives (e.g., 
polygons). 
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The raster subsystem also computes visibility and writes ^«1 data to a 
frame buffer. The raster subsystem 304 takes pixel data generated by raster 
subsystem 304 and selectively updates the image and z bit planes of the frame 

5 buffer, using the results of z comparisons and alpha blending. The raster 
subsystem also performs antialiasing. Additionally, raster subsystem 304 
includes a memory for storing "texels", or texture map elements. Each texel is 
composed of r, g, b, and alpha fields. Based upon the s and t parameters, raster 
subsystem 304 maps a textured pixel color onto each pixel stored in the frame 

10 buffer. 

The display subsystem displays the contents of the frame buffer on a 
computer display monitor. The display subsystem receives the pixel information 
from the frame buffer, routes it through digital to analog converters, and displays 
15 the resulting analog signal on a color monitor. The color monitor displays each 

pixel of the rendered image in accordance with the data describing each respective 
pixel and in the specified image format (e.g., 1600x1200 pixels). 

With reference now to Figure 4, a block diagram of the geometry subsystem 
20 303, the raster subsystem 304 in accordance with present invention is shown. 
The geometry subsystem 303 includes a FIFO buffer 306 coupled to receive 
graphics data via bus 313. As described above, geometry subsystem 303 
performs geometry transformations, as represented by transformer unit 307. 
The raster subsystem 304 includes logic which rasterizes the primitives received 
25 from geometry subsystem 303 and performs the texture mapping computations. 
This logic is implemented in rasterizer unit 308 and texture mapping unit 309. 
Texture memory 310 stores the individual texture map elements (i.e., texels) and 
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provides them to texture map unit 309. The raster subsystem 304 includes 
Hghting and shading unit 311 coupled to receive the pixel data from the texture 
map unit 309. After filtering and blending operations for Ughting, sha<^g, and 
antialiasing, the pixel data is stored in a frame bujBFer 312 and subsequently 
output to display subsystem 305 via bus 314. 

The method of the present invention functions by using the standard 
graphics pipehne hardware to render NURBS surfaces. The process of the 
present invention utihzes the texture mapping unit 309 to directly render NURBS 
models. In one embodiment, the process of the present invention efficiently 
converts the NURBS model to a Bezier model using the hardware of the graphics 
rendering pipehne. The Bezier model describes the same curved surface. The 
process of the present invention subsequently generates a pluraUty of points on 
the surface using the Bezier model and the graphics rendering pipehne. 

As described above, prior art methods do not natively render NURBS 
models with the dedicated rendering hardware (e.g., graphics pipeline 301). The 
NURBS models are transformed into polygon meshes by software executing on 
the host processor (e.g., CPU subsystem 302). The polygon mesh is stored m a 
data base structure in main memory and traversed by the CPU subsystem 302 in 
accordance with the software. The resulting high level graphics data stream is 
transmitted to FIFO 306 and the image is rendered "conventionally" in the 
manner described above. 

The present invention, however, utiUzes the graphics rendering pipehne 301 
to implement a method of natively rendering NURBS models. The raster unit 
308, texture mapping unit 309, and texture memory 310 (each included within 
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raster subsystem 304) are designed and optimized for processing conventional 
graphics primitives and texel data. The present invention utilizes this existing 
hardware to natively render NURBS models without first converting them to 
polygon meshes in CPU subsystem 303. 

5 

With reference now to Figure 5, a block diagram of a tri-linear interpolator 
500 is shown. Tri-linear interpolators (e.g., tri-linear interpolator 500) are utihzed 
extensively within texture mapping unit 309. As is well known in the art, tri-Hnear 
interpolator 500 includes seven linear interpolators 501 through 507. Linear 

10 interpolators 501, 502, and 503 form a bilinear interpolator. Two bi-linear 

interpolators combined with a common interpolator (e.g., linear interpolator 507) 
form a tri-linear interpolator. Bilinear interpolation is used to interpolate texels in 
a 2D image. For example, linear interpolator 501 interpolates between input iO 
and input il. An interpolation ratio is determined by a "weight" . An interpolation 

15 weight closer to 0.0 results in an interpolation output closer to iO, while and 

interpolation weight closer to 1.0 results in an interpolation output closer to il. 
Each linear inteipolator is typically referred to as a "LERP", each interpolation 
weight is referred to as a "LERP weight", and hence, tri-hnear interpolator 500 is 
refeiTed to as a "LERP tree". Quantitatively, the function of each linear 

20 interpolator can be expressed as: ^^"11 + (1-t) * iO, where t equals the LERP 
weight. 

As is well known in the art, bilinear interpolation is conventionally used to 
interpolate between four texels to produce a blended color for a fragment. Four 
25 texels, which surroimd the s and t texture coordinates of interest, are used to 

generate the texture color of each fragment. Two pairs of texels, arranged in a 2x2 
array, are each pairwise interpolated horizontally to generate two new 
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interpolates. These interpolates are then interpolated vertically to generate the 
final filtered texture color (using interpolator 505), 

Tri-linear interpolation is used to interpolate between texels at different 
5 scales (i.e., texel screen projection size). Thus, as is well known in the art, tri- 
Unear interpolation interpolates between the output of two bilinear interpolators. 
Hence, the output of interpolator 505 and the output of interpolator 506 is 
interpolated (by interpolator 507) to obtain a filtered color for the firagment. Thus, 
tri-linear interpolator 500 is often refeired to as a filter. Tri-linear interpolators 
10 (e.g., tri-linear interpolator 500) are utilized extensively in the hardware of texture 
mapping unit 309. 

The process of the present invention, however, uses tri-Unear interpolator 
500 to process NURBS or Bezier control points, instead of texels. First, in the 

15 case of a Bezier curve, the process of the present invention uses tri-Unear 

interpolator 500 to generate points on the surface defined by the Bezier model. 
The Bezier control points define a surface. Once the points on the surface are 
generated, they are feed back into FIFO 306 as graphics primitives and 
subsequently rendered into the .surface defined by the Bezier model. The process 

20 of rendering the points on the surface into an image, occurs in the graphics 
rendering pipeKne, in the manner described above. Thus, the process of the 
present invention renders the Bezier model into a surface without using the CPU 
subsystem (e.g., CPU subsystem 302) or data transfer bandwidth between main 
memory and the CPU subsystem or graphics co-processor. 

25 

Referring now to Figure 6, a spatial representation of four control points of 
a Bezier curve is shown. The shape of a Bezier curve is defined by the control 
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points bO, bl, b2, and b3. The control points bO, bl, b2, and b3 are shown in what 
is referred to in the art as a "de Casteljau form", thus, the spatial relationship 
between the four control points and the Unes connecting them as shown. The 
basic recursive de Casteljau algorithm is stated as: 

5 

Equation 1 

hf = b,^"^ * (u+1) -h bJi - ui 

where bf = bi 

10 The term u is a parametric value and i and k are both indices of the recursion. 

The process of the present invention utilizes tri-linear interpolator 500 to 
evaluate the Bezier curve defined by the control points bO, bl, b2, and b3. The 
process of the present invention implements the de Casteljau algorithm in tri- 
15 linear interpolator 500. 

Referring now to Figure 7A, the four control points bO, bl, b2, and b3 are 
shown plugged into the inputs of tri4inear interpolator 500. Each of interpolators 
501-504 produces a respective intermediate control point as shown (e.g., resulting 

20 point blO shown beneath interpolator 501). The intermediate control points are 
subsequently interpolated vising interpolators 505-507 to obtain a resulting point 
on the curve (e.g., b30 from interpolator 507). In this manner, the linear 
interpolators of tri-linear interpolator 500 are used to generate a series of points 
on the curve. Each "pass" through tri-Hnear interpolator 500 yields one point one 

25 the curve. In accordance with equation 1, by varying the LERP weight used by all 
of tri-linear interpolators 501-507, the output of interpolator 507 yields varying 
points on the curve. The LERP weights numerically range from 0.0 to 1.0, This 
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process of generating points on a curve defined by a cubic polynomial equation 
{e.g., a Bezier curve) in this form is the de Castlejau algorithm, and its execution is 
referred to as evaluation. 

5 Figure 7B shows the spatial relationship of the four Bezier control points 

bO, bl, b2, and b3 along with their respective intermediate control points and the 
resulting point on the curve, b30. As described above, each pass through tri-linear 
interpolator 500 produces a point on the curve. The location of this point is 
determined by the LERP weights used in each of interpolators 501-507. For 

10 example, Figure 7B shows the resulting point on the curve (e.g., b30) in the case 
where the LERP weights are equal to 0.5. It should be noted that a IxERP weight 
at one half the numerical range (e.g., 0.5) yields a point on the curve located at the 
mid point of the curve. Thus, a pass through tri-linear interpolator 500 using a 
LERP weight of 0.25 yields a resulting point on the curve located one quarter of 

15 the way along the length of the curve. In this manner, successive passes are 
made through tri-Unear interpolator 500 using differing LERP weights (e.g., 0.0, 
0.25, 0.5, 0.75, and the like) yielding a series of points on the curve, and ultimately, 
a series of points on the surface. 

20 Figure 7C shows a curve 704 defined by Bezier control points bO, bl, b2, 

and b3. As described above, the resulting point b30 on the curve 704 is located at 
the midpoint of the curve. As successive passes are made through tri-linear 
interpolator 500 with differing LERP weights, differing points on curve 704 are 
defined. Using these resulting points, curve 704 is rendered by the graphics 

25 rendering pipeline 301 (shown in Figure 3). The resulting points are feed back into 
FIFO 306 as graphics primitives and subsequently rendered into the curve defined 
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by the Bezier model. The rendering process subsequently proceeds in the graphics 
rendering pipeline, in the manner described above. 



10 



25 



In this manner, the process of the present invention evaluates and renders 
Bezier curves. Bezier curves are directly evaluated in raster subsystem 304. By 
using the dedicated tri-hnear interpolators of texture mapping unit 309 (shown in 
Figure 4), the process of the present invention implements in fast hardware what 
was conventionally implemented in software on either the CPU subsystem or 
graphics co-processor, thus greatly accelerating the rendering process. 



With reference now to Figure 8A and Figure 8B, the tri-linear interpolator 
500, and four Bezier control points and intermediate control points, are 
respectively shown. The four Bezier control points and intermediate control points 
are shown xising blossom notation. Blossom notation comprises another form of 
15 representing the de Casteljau algorithm and is weU known in the art. The terms of 
the blossom notation represent the repeated Unear interpolations carried out by 
tri-linear interpolator 500. Particularly, the blossom form is defined as follows: 

Eauation 2 



20 abu = abc * (1-u) + abd *u 

where a, b, c, and d are control point values and u is an arbitrary parametric 
variable. Thus, Figures 8A and SB are the same as Figures 7A and 7B except for 
the fact that they are illustrated using the blossom form. 



Accordingly, in order to evaluate and render NURBS models, the NURBS 
models first need to be translated into Bezier models. Once the NURBS model is 
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translated into a Bezier model, the process of the present invention evaluates and 
renders the Bezier model in the manner described above. The blossom form 
provides an efficient way of describing NURBS curves. The present invention 
translates NURBS models into Bezier models to facilitate fast and efficient 
5 evaluation. Bezier models are inherently weU suited to relatively fast evaluation 
using tri-hnear interpolators. However, NURBS models represent useful shapes 
more efficiently than Bezier models. Hence, NURBS models are more widely used 
in the CAD field. 

10 Figure 9A shows the tri-linear interpolator 500 and Figure 9B shows a 

NURBS to Bezier conversion. Control points 012, 123, 234, and 345 comprise 
NURBS control points for a NURBS model. These NURBS control points are fed 
into tri-hnear interpolator 500 in the manner shown in Figure 9A. Each pass 
through tri-linear interpolator 500 yields a Bezier control point at the output bk- 

15 Thus, four passes are required to yield Bezier control points 222, 223, 233, and 
333. Generally, the four Bezier control points define a curve equivalent to that 
defined by the four NURBS control points 012, 123, 234, 345. In a manner 
similar to the Bezier evaluation described above, the LERP weights of each of the 
interpolators are varied to obtain the desired Bezier control point. The LERP 

20 weights are varied in accordance with a table shown in Figure 10. 

Figure 10 shows a table (e.g., table 1) illxistrating the relationship between 
the LERP weights of tri-hnear interpolator 500 and the resulting Bezier control 
points. Thus, in accordance with table 1, to obtain Bezier control point 222, the 
25 LERP weights for the particular interpolators are as shown in the 222 coliimn, to 
obtain Bezier control point 223, the LERP weights for the particular interpolators 
are determined via the terms shown in the 223 column, and so on. In this manner, 
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the process of the present invention determines Bezier control points 222, 223, 
233, and 333 from the NURBS control points 012, 123, 234, and 345. In the 
present embodiment, table 1 is computed on the CPU subsystem (e.g., CPU 
subsystem 302) ahead of time in double precision. 

It should be appreciated that the equations shown in table 1 resvilt from the 
geometric relationship between the control points of NURBS curves and the 
control points of Bezder curves. The equations shown in table 1 can be derived 
directly form the symmetry of the blossom function. For example, in the cubic 
case, going from B-spline control points A, B, C, D, and knot values a, b, c, d, e, and 
f, Bezier control points A', B', C, and D' are derived using the following equations: 

B' = ((c-b)/(e-b)) * (C-B) + B 
C' = ((d-b)/(e-b))*(C-B) + B 
X = ((c-a)/(d-a)) * (B-A) + A 
Y = ((d-c)/(f-c)) * (D-C) + C 
A' = ((c-b)/(d-b))*(B'-X)+X 
D' = ((d-c)/(e-c)) * (Y-C) + C 

where X and Y are temporary variables. 

The above equations implemented in a tri-linear interpolator 500 as shown 
in Figure 8A and Table 1 of Figure 10. Thus, in a hardware implementation, there 
are four separate LERP weights and four control points received from two 
"textures". Thus, the control points would be the NURBS control points taken 
four at a time from a control "mesh", and the four LERP weights would come from 
another texture or lookup table generated from the knot sequence. 
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Before the conversion from NURBS to Bezier, the NURBS curve needs to 
be transformed from the global to the local domain. The present invention 
implements this function by transforming from the global NURBS domain to the 
local Bezier domain. The global to local transformation is required due to the fact 
that, in the present embodiment, the NURBS curve and the Bezier curves 
resulting from the translation process are both defined over the interval [0, 1]. As 
is well known in the art, a NURBS model includes "knot vectors" which, in addition 
to the NURBS control points, define the curve. The transformation from the 
global to the local domain is governed by the equation: 

Equation 3A 



where u equals the b-spline local parametric value, 1? equals the NURBS global 
parametric value, and ^ and ^ are both equal to a knot value from the NURBS 
curve. The present invention solves these terms and subsequently uses them 
with tri-linear interpolator 500 to convert NURBS models into Bezier models. 

The other component of the global to local transform is the association 
between b-spline domain integral and the parameter value, u. As is well known in 
the art, for each value of u, there is a "block index", b, which identijaes the location 
of the set of NURBS (and Bezier) control points which affect the surface shape at 
that parameter value, u. 

Equation 3B 




^ /s — \ 

b<u<b+ 1 
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This equation can be encoded in a texture map or lookup table, in a fashion very 
similar to the encoding of the global to local parameter transformation described 
above. 

5 

Thus, after the NURBS curve is converted to a Bezier curve, the present 
invention evaluates and renders the Bezier curve in the manner described above. 
It should be appreciated that the conversion process takes place within the 
graphics pipeline hardware. As such, the present invention does not place undue 
10 data transfer bandwidth loads on the computer system. 



It should be appreciated that each b spline domain interval defines an 
independent Bezier curve. To be more precise, for each B-spline domain interval 
with a sufBcient number of knots to either side, the B-spline domain interval 
15 defines an independent Bezier curve. B-spline domain segments near the 
beginning and the end of a B-spline control point list do not have a sufficient 
number of knots or control points defined to determine a Bezier curve. A. cubic B- 
spiine curve with n vertices generates up to n-3 Bezier carves, generating 4(n-3) 
Bezier vertices. 

20 

Now consider the case of a Bezier surface patch. Points are evaluated on a 
surface in two stages. The Bezier patch consists of a 4x4 array of points. The 
present invention needs to evaluate at point 'T" with coordinates of (s, t). In the 
first stage, the four rows of the patch are considered as Bezier curves, and 
25 evaluated at the s coordinate of each of these curves. In the second stage, the 
resulting four points from the first stage are treated as four points of a single 
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Bezier curve. The present invention evaluates this curve at coordinate t, thereby- 
obtaining the desired point P. 

With reference now to Figure llA and Figure IIB, a process in accordance 
5 with an alternative embodiment of the present invention is shown. In this 

alternate embodiment, the NURBS model is directly rendered into a surface, as 
opposed to first being converted into a Bezier model and subsequently being 
rendered as described above. This embodiment implements a "Mansfield 
algorithm". 

10 

Figure llA shows a pluraHty of look up tables 1150. The look up tables 
1150 are indexed with texture parameters s, and t. These parameters are 
mapped by look up tables 1150 into a series of LERP weights and a plurality of 
NURBS control points implementing a global to local transformation. 

15 

Figure IIB shows a tri-linear filter 500 which implements the direct 
NURBS evaluation. Tri-linear filter 500 includes two rows of inputs. The first 
rov/ of inputs is comprised of NURBS control points A, B, C, D, E, F, G, and K. 
The second row of inputs is comprised of NURBS control points I, J, K, L, M, N, 0. 

20 and P. Each of the hnear interpolators 501-507, as described above include a 

LERP weight parameter. The LERP weight parameters are shown as al, a2, a3, 
a4, a5, a6, and a7, respectively. In this embodiment, the output of interpolator 
507 is coupled to a register and to one input of an interpolator 1160. The register 
is also coupled to interpolator 1160 via its second input. Interpolator 1160 uses 

25 LERP weight parameter a8. The output of interpolator 1160 is bk, a point on the 
surface. When used as described above, the tri-Unear interpolator 500 performs 
what is referred to as a "separable convolution". 
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The process of the present embodiment evaluates the surface s(u,v) by 
implementing the following equation: 



20 



5 Equation 3C 

S(u,v) = SINf (u)Nf (v)dij 
i j 

Nr^(u) Nj;l(u) 

for U, Ni (u) = (U - Ui) — — + (Ui+r+l - u) . , „. , 

whereNi=|o 

Note that there is an equivalent expression for v. 
10 Additionally, note that dij are the local NURBS control points for the 

parameter value (u,r). The terms u and v are parameters of the surface, i and j 
are indices of the control points, and r is an index of the recursion. 

By implementing equation 3C, the process of the present embodiment is able to 
15 directly evaluate a NURBS model into a surface. This process does not require 
the prior conversion of the NURBS model into a Bezier model. In order to 
implement equation 3C, the LERP weights (al-a8) are loaded as described below. 



In the general case, (e.g., ID) a convolution equation is expressed as: 



Equation 3D 

m 

C(x)= XK(h)I(x-h) 
h=0 



Where C denotes the convolved result at point x, I is the input image, and K being 
25 the m convolution kernel coefficients. Given equation 3D, the present invention 
derives the alpha values (i.e., LERP weights) used by the LERPS in the tri-linear 
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interpolator 500 based on the kernel values. The present invention extrapolate 
equivalencies between the original kernel coefficients based on going through the 
tri-linear interpolator 500: 



5 The original ID convolve is expressed as: 

koiO+kiii +k2i2+k3i3+k4i4+k5i5+k6i6+k7i7 
Using tri-Hnear interpolator 500, the expression is: 

10 

Equation 3E 

ai^ + ( 1 - a) z\ + (l-b)i^ ci^ + ( 1 - c) di^ + {l-d^ij 

' ' + ' ^ ' ^ V '-H^ . ' 



8 (l-g) 

15 : ' 

h (scale) 



where a though g equal LERP weights and h equals the scale, and ix are the inputs 
of the input image- ^ov the case of convolutions, the last LERP h (using this 
notation) is assigned h= 1,0, 

20 

The last step for convolutions is to scale the output to de-normahze the 
result. We can form equivalencies between the Ks and the alphas by algebraic 
manipulation as follows: 



25 Equations 3F 
kQ = a^e^g'^'h 
ki = (l-a)*e*g*h 
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k2 


= b. (1-e) *g*h 


k3 


= (l-b)*(l-e)*g*h 


k4 


= c*f*(l-g)*h 


k5 


= (l-c)*f*(l-g)*h 


k6 


= d*(l-f)*(l-g)*h 


k7 


= (l-d)*(l-f)*(l-g)*h 



Equations 3F also take into account that the LERP alpha values are 
normalized. Additionally, before alpha values for each LERP are extrapolated, the 

10 present invention also needs to account for possible divide by zeros. These could 
be caused by either very sparse kernel matrices or coefficients that may cancel 
each other out (as is the case for some edge detectors). To compensate for these 
effects, the present invention works with the absolute values of each kernel 
coefficient and compensates by negating one of the inputs when the coefficient is 

15 negative. Given this, the LERP alphas (i.e., LERP weights) are derived as: 



Equations 3G 
|ko 

a = 



ko + ki 



k2 

20 b = 



c = 



|k2| + Iksl 

|k4| 
|k4| + Iksl 

Ikel +|k7| 
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ko + ki 



f= 



|ko| 


+ |ki| 


+ 


1^2! + 


Iksl 




|k4| 


+ 


Iksl 




IM 


+ |k5| 


+ 


kel + 


IM 






+ 


Iksl 





k4 + ks + k6 + k? 



| ko| +|ki| +|k2| +|k3| 

^"|ko| + |ki| + |k2| +|k3| + |k4| +|k5| +|k6| + |k7| 

h = |ko| +|ki| +|k2| +|k3| +|k4| +|k5| +|k6| +|k7|** 

where a, b, c, d, e, f, and g, are LERP weights. The last LERP, h, can be used to 
accumulate the current result with the previous pass. However, because the 
present invention is interpolating as opposed to simple accumulating, the present 
invention needs to account for the contributions of previous passes (i.e., doing an 
internal multi-pass). Thus, the h for pass j would really be represented as: 

Equation 3H 

h,=^^ 

ft(l-hi) 

i=j 

Where j is the current pass, hi is the alpha value for pass i, n is the total number 
of passes, and the sum of | Kj ] is the magnitude (or sum) of all of the kernel 
cdefScients corresponding to pass j. One alternative is to bypass the last LERP, 
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and use the accumulation logic provided in the "pixel pipe" of the graphics pipeline. 
This ensures better precision and leaves the h alpha value as an escpression 
dependent solely on the coefficients of the current pass. Because these values are 
normalized, a pre-processing step that is required with this alternate approach is 
5 that the present invention needs to first normalize the K's before performing the 
conversions and scale back to de-normalize on the last pass. 

As described above, to compensate for negative kernel coefficients, the 
present invention negates the inputs based on the following: 

10 

neg B for LERP a = sign(kO) xor sign(kl) 
neg B for LERP b = sign(k2) xor sign(k3) 
neg B for LERP c = sign(k4) xor sign(k5) 
neg B for LERP d = sign(k6) xor sign(k7) 
15 neg B for LERP e = sign(kl) xor sign(k3) 
neg B for LERP f = sign(k5) xor sign(k7) 
neg B for LERP g = sign(k3) xor sign(k7) 
sign of scale factor = sign(k7) 

20 In addition, the texture filter (e.g., tri-linear interpolator 500) can also swap inputs 
A and B if necessary. For instance, if ko is negative (and ki positive), the A input 
must be negated. So, for the "a" LERP, the inputs are swapped and B negated. 

Referring still to Figures llA and IIB, the present invention uses look up 
25 tables 1150 to implement the global to local transformation. Texture parameters 
s and t are used to index the look up tables 1150 to obtain a set of NURBS control 
points (i.e., the set of NURBS control points A though P) appropriately indexed for 
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the local domain. These NURBS control points comprise a portion of an array of 
NURBS control points, which in turn, comprise the NURBS model. The NURBS 
control points A through P comprise a global to local transformed 4x4 array. 
Additionally, the look up tables yield the LERP weight parameters al through a8. 
5 The LERP parameters are utiUzed by interpolator 500 to evaluate the NURBS 
control points A through P. This causes tri-linear interpolator 500, as shown in 
Figure IIB, to implement convolution. 

The NURBS control points A through P are fed through tri-Hnear 
10 interpolator 500 in two separate stages. In the first stage, NURBS control points 
A through H are fed through, using LERP weight parameters al through al. .The 
output of interpolator 507 (e.g., the first result) is stored in the register. In the 
second stage, NURBS control points I through P are fed through. This time, the 
output of interpolator 507 (e.g., the second result) is coupled to the first input of 
15 interpolator 1160 and the first result is coupled to the second input of interpolator 
1160. The interpolator 1160 combines the first and second results, using LERP 
weight parameter a8, to generate bk, a point on the surface. Thus, tri-linear 
interpolator 500 is utilized to implement a quadri-linear inteiTpolator as shown in 
Figm^e IIB. Again, it should be noted that tri-linear interpolator 500, in the above 
20 manner, computes a convolution of the NURBS control points. 

Hence, the present invention generates a plurality of points on the surface. 
These points are fed back through the graphics rendering pipeline and are 
subsequently rendered into a resulting surface. Thus, the present invention, in 
25 accordance with this alternate embodiment, directly renders a surface firom a 

NURBS model. As with the previous embodiment, the NURBS model is rendered 
using the graphics rendering pipeline (e.g., graphics rendering pipeline 301). 
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With reference now to Figure 12A and Figure 12B, a diagram 1200 and a 
diagram 1201 are respectively shown. Diagram 1200 and diagram 1201, in 
accordance with the present invention, represent the processes for implementing 
automatic normal generation. In diagram 1200, block 1202 is the global to local 
domain transformation, block 1203 is the evaluation process of generating points 
on the curve, as described above. Blocks 1204 and 1205 are the Su and sy partial 
generation processes. From these partials, the Su and Sy tangents are derived in 
blocks 1206 and 1207. From these tangents, the normals to the surface are 
derived in block 1208. In diagram 1201, the points on the curve are determined, 
as described above, in block 1209. The Su partials are determined in block 1210. 
From these, the sv partial/normals are determined in block 1211. It should be 
noted that the su and sv partials can be computed in conjimction with either 
surface evaluation embodiment described above (e.g., evaluation via the NURBS 
to Bezier conversion process, or direct NURBS evaluation via the convolution 
process). 

Diagram 1200 generates analytically correct normals by computing the 
cross product of u-tangents and v tangents. The u-partials are defined by the 
equation: 

Equation 4 

Su = [xu yu Zu Wul'^ 
where Su and sy are the surface partials. 

The partials may be computed using either method (i.e., either the NURBS to 
Bezier conversion or the Mansfield convolution). Equation 4 similarly defines Sy, 
the v-partials. The u-tangents are defined by the equation: 
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Equation 5 

Su =WSu-WuS 

5 The analytic normal direction is thus: 

Equation 6 

n= Su X Sv 

10 Diagram 1201 shows an alternative to diagram 1200 and produces a 

reasonable approximation in less time. The difference between diagram 1200 and 
diagram 1201 is shown in equation 7 below: 

Equation 7 

15 X "s^ = w2(su X Sv) + w(s X (SuWu - SyWy)), SO if SuWu - SvWv = 0, then Su X Sv 

Su X Sv and n ^ Su x Sv 

In accordance with the present invention, surface partials are computed by 
the tri-Unear filter (e.g., tri-linear filter 500). It should be noted that for surface 
20 partials, the surface partial of a point evaluated on a Bezier curve is equal to: 

Equation 8 

2uu - uu3 
2 

25 

Thus, equation 8 is computed by negating the second input to interpolator 507 in 
tri-linear interpolator 500 and using a LERP weight equal to 1/2. 



SGI15-4-453.00 



37 



April 21, 1997 



Figure 13A and Figure 13B show tri-linear interpolator 500 implementing 
surface partial evaluation in accordance with the present invention. It should be 
noted that the inputs to tri-linear interpolator 500 are the same as they were 
during the Bezier evaluation shown in Figure 8A (which is similarly the same for 
the convolution process described above). The output of tri-linear interpolator 500 
is the resulting partial vector (e.g., Su). 

The present invention subsequently uses the graphics pipeline hardware, 
specifically, a "blender" included within raster subsystem 304. Since, equation 5 is 
very similar to operations already implemented in the hardware of the blender, the 
present invention utilizes the blender to compute tangents from partials. The 
present invention uses a two step process. In the first step, the partials are 
evaluated into the frame buffer of raster subsystem 304 as a vertex mesh. In the 
second step, the surface points are fed back through the rasterizer where they are 
blended with the partial mesh. The blending ftinctions are arranged according to 
equation 9: 

Equation 9 

r =rs(Xd-rdas 

[x,y,z,w]T is sent as (rs,gs,bs,as) data and (rd,gd,bd,ad) is equal to [xu,yu,Zu,Wu] 
By using a subtract blending mode, the blender effectively computes: 
Equation 10 

XWu-XuW 
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Normals are also computed as cross products in the blender using a new blending 
factor referred to as a "cyclic shift". Cyclic shift source factors operate as follows: 

Equation 11 

r =rsgd-rdgs 
g =g3bd-gdbs 
b =bgrd-bdrs 
a=0 

The cross product desired is: 

Equation 12 

Hx = yuZv - yyZu 

Ily — Z^Xy " ZyX^ 

_nz = Xuyv - Xvyu. 
Instead, what is actually computed is: 

Equation 13 
[nznxUy] 

Thus, in accordance with the process of the present invention: 
Equation 14 

[xuyuZuy]'^ is sent as (rg, gg, bg, ag) [xyyvZyxlT is arranged to be Crd, gd, bd, ad) and 
thus, [nznxnyOjT is computed as desired. 

With reference now to Figure 14, flowchart the steps of a method 1400 in 
accordance with one embodiment of the present invention is shown. In step 1401, 
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a NURBS model is received for rendering. In the present embodiment, the 
NURBS model is generated by a software program executing on the CPU 
subsystem. As described above, the NURBS model represents a 3D object. 

5 In step 1402, the process of the present invention converts the NURBS 

model into a Bezier model. This involves converting the individual NURBS curves 
into corresponding Bezier curves. The present invention utiHzes the tri-linear 
interpolators included with the hardware of the graphics pipehne to implement the 
NURBS to Bezier conversion process. Each NURBS curve to be rendered is 
10 converted into equivalent Bezier curves as described above. This is accomplished 
by performing curve conversions in the u direction and then in the v direction. 

In step 1403, the process of the present invention generates points on the 
curve defined by the Bezier control points of the Bezier model. As described above, 
15 the Bezier control points are obtained via the NURBS to Bezier conversion 

process. The Bezier control points define a Bezier curve. The present invention 
uses the tri-linear interpolators to generate points on this curve. Again, this 
equation is performed in first the u direction and then in the v direction. 

20 In step 1404, the present invention compiites normals to the surface. As 

described above, the su and sy partials are generated using the tri-linear 
interpolators. From these partials, the and Sy tangents are derived using the 
blender hardware. From these tangents, the blender hardware is used to generate 
the normals to the surface. The normals are subsequently used to realistically 

25 render lighting effects on the surface. 
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In step 1405, the present invention renders the surface defined by the 
generated points and normals. As successive points are rendered, the graphics 
pipeline creates primitives (e.g., "quads") to render a surface. The curves in one 
direction (e.g., in the x direction) are rendered, the curves in another direction (e.g., 
the y direction) are rendered, creating a Une drawing. Alternatively, the curves are 
coupled to form a quadrilateral mesh defining a surface. 

Thus, the present invention provides a method and a system which greatly 
increases the speed of the graphics rendering process. The method and system of 
the present invention increases the speed and efficiency of the process of 
rendering NURBS models. The method and system of the present invention 
accurately renders NURBS models without bxordening the data transfer 
bandwidth of the computer system. In addition, the method and system of the 
present invention does not consume an inordinate amoimt of host processor clock 
cycles. In so doing, the present invention provides a fast and efficient process for 
rendering NURBS models. 

Referring now to Figure 15, a computer system upon which the present 
invention may be practiced is shown as 1500. Computer system 1500 includes 
any computer controlled graphics systems for generating complex or 3 
dimensional images. Computer system 1500 includes a bus 1501 for transmitting 
digital information between the various parts of the computer system. One or 
more microprocessor(s) 1502 are coupled to bus 1501 for processing information. 
The microprocessor(s) 1502 comprise CPU subsystem 302 shown in Figure 3. 
The information along with the instructions of how the information is to be 
processed are stored in a hierarchical memory system comprised of mass storage 
device 1507, read only memory 1506, main memory 1504, and static random 
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access memory (SRAM) 1503. Mass storage device 1507 is used to store vast 
amounts of digital data. The mass storage device 1507 can consist one or more 
hard disk drives, floppy disk drives, optical disk drives, tape drives, CD ROM 
drives, or any niamber of other types of storage devices having media for storing 
5 data digitally. A read only memory (ROM) 1506 is used to store digital data of a 
permanent basis, such as instructions for the microprocessors. Main memory 
1504 is used for storing digital data on an intermediate basis. Main memory 1504 
can be dynamic random access memory (DRAM). 

10 The 3D graphics rendering pipeline 301 is also included in system 1500. As 

described above, processor(s) 1502 provides the graphics rendering pipeHne 301 
with graphics data, such as drawing Commands, coordinate vertex data, and other 
data related to an object's geometric position, color, texture, shading, and other 
surface parameters. The display subsystem 305 displays the rendered image on 

15 monitor 1521. 

Several other devices may also be coupled to system 1500. For example, 
an alphanumeric keyboard 1522 is used for inputting commands and other 
information to processor 1502. Another type of user input device is cursor control 

20 device 1523 (a moiise, trackball, joystick, and the like) used for positioning a 

movable cursor and selecting objects on a computer screen. Another device which 
may be coupled to bus 1501 is a hard copy device 1524 (e.g., a laser printer) for 
printing data or other information onto a tangible medixma. Additionally, a sound 
recording or video option 1525 can be coupled to the system 1500 to provide 

25 multimedia capabilities. 
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The present invention, a method and system for efficiently drawing NURBS 
surfaces for 3D graphics, is thus disclosed. While the present invention has been 
described in particular embodiments, it should be appreciated that the present 
invention shoxild not be construed as limited by such embodiments, but rather 
construed according to the below claims. 
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A METHOD AND SYSTEM FOR EFFICIENTLY EV AT.TTATING AND 
DRAWING NURBS SURFACES FOR 3D GRAPHICS 



CLAIMS 
5 What is claimed is: 

1. In a computer system having a processor, a bus, and a graphics 
rendering pipeline for displaying 3D graphics on a display, a computer 
implemented method for rendering a curve or a surface, the method comprising 
10 the computer implemented steps of: 

a) receiving a NURBS model for rendering from a software program 
running on the processor of the computer system ; 

b) converting the NURBS model to a Bezier model using the graphics 
rendering pipeline; 

15 c) generating a plurality of points on a curve or siuface, wherein the curve 

or surface is defined by the Be2der model, using the graphics rendering pipeline; and 
d) rendering the ciirve or surface using the plurality of points and using the 
graphics rendering pipeHne. 

20 2. Tiie method of claim 1 wherein step a) further includes the step of 

receiving the NURBS model in the graphics rendering pipehne via the bus, wherein 
the NURBS model defines all of a curve or surface, or a portion of the curve or 
surface, to be rendered. 

25 3. The method of claim 1 wherein step b) further includes the step of 

generating a pluraUty of Bezier control points from a corresponding plurahty of 
NURBS control points. 
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4. The method of claim 3 fiirther including the step of using a tri-linear 
interpolator to generate the plurality of Bezier control points from the 
corresponding plurality of NURBS control points. 

5 

5. The method of claim 4 further including the steps of: 

using the plurality of NURBS control points as inputs to the tri-linear 
interpolator; and 

evaluating the NURBS control points to obtain each of the plurality of 
10 Bezier control points. 

6. Tlie method of claim 1 wherein step c) further includes the step of 
generating a plurality of points on the curve or surface using a plurality of Bezier 
control points. 

15 

7. The method of claim 6 fiorther including the steps of: 

using the plurality of Bezier control points as inputs to a tri-linear 
interpolator; and 

evaluating the plui-ality of Bezier control points to obtain the plurality of 
20 points on the curve or surface. 

8. The method of claim 1 further including the steps of: 
processing the plurality of points with the graphics rendering pipeline; 
rendering the curve or surface with the graphics rendering pipeline; and 

25 rendering the curve or surface with the graphics rendering pipeline. 
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9. In a graphics rendering pipeline of a computer system, a method for 
rendering curves or surfaces using the graphics rendering pipehne, the method 
comprising the steps of: 

a) implementing a de Casteljau process in the graphics pipeline; 
5 b) evaluating a Bezier curve or surface using the de Casteljau process; and 

c) rendering the Bezier curve or surface. 

10. The method of claim 9 further including the step of implementing the de 
Casteljau process using a tri-linear interpolator included in the graphics pipeline. 

10 

11. Themethodof claim 9 further including the steps of: 

loading inputs of the tri-linear interpolator with a plurality of control points 
of the Bezier cxirve or surface; and 

generating a plurality of points on the curve or surface using the tri-linear 

15 interpolator. 

12. The method of claim 11 further including the step of using the pltirality 
of points to render the Bezier ciorve or surface. 

20 13. In a "graphics rendering pipeline of a computer system, a method for 

converting a NURBS (non-uniform rational B-spline) curve or surface to a Bezier 
curve or surface using the graphics rendering pipeline, the method comprising the 
steps of: 

a) loading a pliarality of NURBS control points of a NURBS curve or 
25 surface into the graphics rendering pipehne; 

b) evaluating the plurality of control points into a resulting plurality of 
Bezier control points; 
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c) generating a Bezier curve or surface using the resulting plurality of 
Bezier control points; and 

d) rendering the Bezier curve or surface using a plxiraUty of vertices derived 
from the plurality of Bezier control points. 

5 

14. The method of claim 13 further including the steps of: 

loading the plxoraHty of NURBS control points into inputs of a tri-linear 
interpolator included in the graphics rendering pipeline; and 

evaluating the pluraHty of NURBS control points into the resulting 
10 plurality of Bezier control points using the tri-linear interpolator. 

15. The method of claim 13 further including the step of transforming the 
NURBS CTxrve or surface from a global domain to a local domain. 

15 16. In a graphics rendering pipeline of a computer system, a method for 

generating normal vectors (normals) for a surface, the method comprising the 
steps of: 

a) generating a pluraHty of surface partials from the surface using the 
graphics rendering pipeUne; 
20 b) generating a plTxrahty of surface tangents from the plurality of surface 

partials using the graphics rendering pipeline; and 

c) generating at least one normal from the plurality of surface tangents 
using the graphics rendering pipeline. 

25 17. The method of claim 16 further including the steps of: 

loading inputs of a tri-Unear interpolator included in the graphics rendering 
pipeline with a plurality of Bezier control points defining the surface; and 
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evaluating the plurality of Bezier control points to generate the plurality of 
sTirface partials. 

18. The method of claim 16 further including the step of generating the 
pluraUty of surface tangents from the plurality of surface partials using a blender 
included in the graphics rendering pipeline. 

19. The method of claim 18 fixrther including the step of generating the at 
least one normal from the plurality of surface tangents using the blender. 

20. In a graphics rendering pipeline of a computer system, a method of 
using the graphics rendering pipeline to render a curve or surface directly from a 
NURBS (non-uniform rational B-spline) model, the method comprising the steps 
of: 

a) performing a global to local transformation on a NURBS model using the 
graphics rendering pipeline; 

b) evaluating a plurality of NURBS control points using the graphics 
rendering pipeline to obtain a plurality of points on a curve or surface defined by 
the NURBS model; and 

c) rendering the curve or siuface using the plurality of points. 

21. The method of claim 20 further including the step of indexing at least 
one look up table within the graphics rendering pipeline to perform the global to 
local transformation of the NURBS model. 
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22. The method of claim 21 further including the step of evaluating the 
plurahty of NURBS control points using a tri-linear interpolator included in the 
graphics rendering pipeline. 

23. The method of claim 22 further including the step of indexing the at 
least one look up table with the graphics rendering pipeUne to obtain a plurality of 
parameters to configure the tri-Unear interpolator; 

24. The method of claim 23 further including the steps of: 
implementing a quadri-linear interpolator using said tri-Unear interpolator; 

and 

generating the plurahty of control points using said quadri-linear 
interpolator. 

25. The method of claim 20 wherein step b) further includes the steps of: 
using the graphics rendering pipeline to implement a convolution; and 
using the convolution to obtain the plurahty of points on the curve or 

surface. 
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A METHOD AND SYSTEM FOR EFFICIENTLY DR AWING NURBS 
STTT?.FACES FOR 3D GRAPHICS 



ATtSTRACT OF THE DISCLOSURE 
5 The present invention comprises a computer implemented process and 

system for rendering curves or surfaces as 3D graphics on a display. The system 
of the present invention includes a computer system having a processor, a bus, 
and a 3D graphics rendering pipeline. The curves or surfaces are modeled by non- 
nnifnrm rational B-splines (NURBS). The process of the present invention 

10 fiinctions by receiving a NURBS model for rendering from a software program 

running on the host processor. The NURBS model defines a curve or svirface. The 
process of the present invention efficiently converts the NURBS model to a Bezier 
model usir^ the hardware of the graphics rendering pipeline. The Be2der model 
describes the same ctirve or surface. The process of the present invention 

15 subsequently generates a plvirality of points on the curve or surface using the 

Bezier model and the graphics rendering pipeline. The points are then used by the 
graphics rendering pipeline to render the curve or surface defined by the Bezier 
model. Alternatively, a NURBS model is directly evaluated into a plurality of 
points on a curve or siarface, and in turn, rendered into the curve or surface. This 

2 0 direct rendering of the NURBS model is implemented using the graphics rendering 
pipeline. 
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